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1.  Contract  Information 


(a)  Title:  Speech  Recognition:  Acoustic,  Phonetic,  and  Lexical  Knowledge  Representation 
fb)  ONR  Contract  No.:  N00014-82-K-0727 

(c)  Principal  Investigator:  Victor  W.  Zue 

(d)  ONR  Scientific  Ofllcor:  Cmdr.  Allen  Sears,  DARPA-IPTO 

(e)  Period  Covered:  1  October  1984  to  30  September  198S 

2.  Research  Description 


(a)  Description  of  itesearch: 

/Our  long-term  research  goal  is  the  development  and  implementation  of  speaker-independent 
continuous  speech  recognition  systems.  It  is  our  conviction  that  proper  utilisation  of 
speech-specific  knowledge  is  essential  for  advanced  speech  recognition  systems.  With 
this  in  mind,  we  have  continued  to  make  progress  on  the  acquisition  of  acoustic-phonetic 
and  lexicnl  kncwlcdc 


tesults  in  the  Last  Year: 

'Wc  have  completed  the  development  of  a  continuous  digit  recognition  system.  The 
system  was  constructed  to  investigate  the  utilisation  of  acoustic  phonetic  knowledge 
in  a  speech  recognition  system.  Some  of  the  significant  development  of  this  study 
includes  a  '‘soft-failure’^  procedure  for  lexical  access,  and  the  discovery  of  a  set  of 
acoustic-phonetic  features  for  verification. 

We  have  completed  a  study  of  the  constraints  provided  by  lexical  stress  on  word 
recognition.  We  found  that  lexical  stress  information  alone  can,  on  the  average, 
reduce  the  number  of  word  candidates  from  a  large  dictionary  by  more  than  80%.  In 
conjunction  with  this  study,  we  successfully  developed  a  system  that  automatically 

determines  the  stress  pattern  of  a  word  from  the  acoustic  signal.  - 

We  have  performed  an  acoustic  study  on  the  characteristics  of  nasal  consonants  and 
nasalised  vowels.  We  have  also  developed  recognition  algorithms  for  nasal  murmurs 
and  nasalised  vowels  in  continuous  speech.  _ 

We  have  finished  the  preliminary  development  of  a  system  that  aligns  a  speech 
waveform  with  the  corresponding  phoDctic  transcription. 


(c)  Plans  for  Next  Year’s  Research: 

•  We  will  continue  our  investigation  of  the  constraints  imposed  by  the  language  on 
permissible  sound  patterns.  In  particular,  we  will  investigate  in  greater  detail  the 
role  played  by  lexical  stress.  We  will  also  expand  our  study  of  lexical  constraints  to 
iuclude  larger  linguistic  units  such  as  syllables  and  metrical  feet. 

•  We  will  incorporate  the  knowledge  gained  through  these  studies  into  recognition 
systems.  Specifically,  we  will  try  to  improve  the  performance  of  the  lexical  stress 
recognition  system,  and  we  will  continue  our  development  of  a  broad  phonetic  clas¬ 
sifier,  with  emphasis  on  handling  continuous  speech. 
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•  Wc  will  investigate  the  feasibility  of  constructing  a  knowledge-based  system  that 
attempts  to  mimic  the  procedure  of  spectrogram  reading. 

(d)  Presentations: 

A.  M.  Anil  and  V.  W.  Zue,  ‘Lexical  Stress  and  Its  Application  in  Large  Vocabulary 
Speech  Recognition,”  108th  Meeting  of  the  Acoustical  Society  of  America,  Minneapolis, 
MN,  Oct.  1984. 

J.  R.  Glass  and  V.  W.  Zue,  ‘An  Acoustic  Study  of  Nasal  Consonants  u  American  En¬ 
glish,”  108th  Meeting  of  the  Acoustical  Society  of  America,  Minneapolis,  MN,  Oct.  1984. 

A.  M.  Anil  and  V.  W.  Zue,  ‘Lexical  Stress  and  Its  Application  in  Large  Vocabulary 
Speech  Recognition,”  ICASSP  85,  Tampa,  FL,  March  1985. 

J.  R.  Glass  and  V.  W.  Zue,  ‘Detection  of  Nasalised  Vowels  in  American  English,” 
ICASSP  85,  Tampa,  FL,  March  1985. 

K.  Moore  and  V.  W.  Zue,  ‘The  Effect  of  Speech  Rate  on  the  Application  of  Low-Level 
Phonological  Rules  in  American  English,”  109th  Meeting  of  the  Acoustical  Society  of 
America,  Dallas,  TX,  April  1985. 

V.  W.  Zue  and  D.  S.  Cyphers,  “The  MIT  Spire  System,”  Speech  Tech  ’85,  New  York, 
NY,  April  1985. 

V.  W.  Zue,  ‘Human-Machine  Communication  by  Voice:  Computer  Recognition  of  Speech,” 
Annual  Meeting  of  American  Association  for  the  Advancement  of  Science,  Los  Angeles, 
CA,  May  1985. 

J.  R.  Glass  and  V.  W.  Zue,  ‘Analysis  and  Recognition  of  Nasal  Consonants  in  American 
English,"  IASTED  Robotics  and  Automation  Conference,  Lugano,  Switzerland,  June 
1985. 

H.  C.  Leung  and  V.  W.  Zue,  “Automatic  Alignment  of  Phonetic  Transcriptions  with 
Continuous  Speech,"  IASTED  Robotics  and  Automation  Conference.  Lugano,  Switzer¬ 
land,  June  1985. 

(e)  Publications: 


Papers 

A.  M.  Anil  and  V.  W.  Zue,  ‘Lexical  Stress  and  Its  Application  in  Large- Vocabulary 
Speech  Recognition,”  J.  Acoust.  Soc.  Amer.,  suppl.  1,  vol.  76,  Fall  1984,  p.  S47. 

A.  M.  Anil  and  V.  W.  Zue,  “Lexical  Stress  and  Its  Application  in  Large  Vocabulary 
Speech  Recognition,"  Proc.  ICASSP  65,  vol.  4,  pp.  1549-1552,  1985. 
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J.  R.  Glass  and  V.  W.  Zue,  'Acoustic  Characteristics  of  Nasal  Consonants  in  American 
English,”  J.  Acoutt.  Soc.  Amer.,  suppl.  1,  to).  76,  Fall  1984,  p.  SIC*. 

J.  R.  Glass  and  V.  W.  Zue,  'Detection  of  Nasalised  Voxels  in  American  English,”  Proc. 
ICASSP  85,  rol.  4,  pp.  1569-1572,  1985. 

V.  W.  Zue  and  D.  S.  Cyphers,  “The  MIT  Spire  System,”  Proe.  Speech  Tech  '85,  pp. 
277-279,  1985. 


These* 

F.  R.  Chen,  “Acoustic-Phonetic  Constraints  in  Continuous  Speech  Recognition:  A  Case 
Study  Using  the  Digit  Vocabulary,”  Ph.D.  thesis,  June  1985. 

D.  S.  Cyphers,  “Spire:  A  Research  Tool,”  S.M.  thesis,  May  1985. 

J.  R.  Glass,  “Nasal  Consonants  and  Nasalised  Vowels:  An  Acoustic  Study  and  Recog¬ 
nition  Experiment,”  S.M.  thesis,  December  1984. 

H.  C.  Leung,  “A  Procedure  for  Automatic  Alignment  of  Phonetic  Transcriptions  with 
Continuous  Speech,”  S.M.  thesis,  January  1985. 


(f)  Participants: 

Mark  Anderson 

Franriue  R.  Chen  (Ph.D.,  June  1985) 

D.  Scott  Cyphers  (S.M.,  May  1985) 

Janies  R.  Glass  (S.M.,  December  1984) 

Daniel  Huttculochcr 
Lori  F.  Lamel 

Niels  Lauritzen  (S.M.,  May  1985) 

Hong  0.  Leung  (S.M.,  January  1985) 

Kimberly  Moore 
Jacqueline  Vaissicre 
Victor  W.  Zue 

(g)  Other  Sponsored  Resea. 

•  Title:  Acoustic-Phonetics  Based  Speech  Recognition 
Sponsor:  Naval  Electronic  Systems  Command  ( DARPA  )| 
Amount:  $463,284.00 

•  Title:  Speech  Database  Development 
Sponsor:  Naval  Electronic  Systems  Command  (DARPA 
Amount:  $501,194.00 

•  Title:  Tools  for  Speech  Analysis  and  Research 

Sponsor:  Space  and  Naval  Warfare  Systems  Command  (DARPA) 
Amount:  $641,526.00 
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